Make JRA55 default forcing dataset #533

apcraig · 2020-11-24T19:32:58Z

This is a work in progress
@eclare108213 edited to add list of to-do items below.

PR checklist

See also #530

First Step, add all the new jra55 tests, 510221b

jra55 tests added as follows for different grids
gx3 uses jra55_gx3
gx1 uses jra55_gx1
tx1 is already using jra55 by default (no changes to tests required)
gbox* uses internal forcing, no jra55 data (no changes to tests required)

remove ocn_data_dir='default' and bgc_data_dir='default' in set_nml.jra55*
files because it causes the bgc tests to fail. the bgc tests work with WOA bgc
forcing data with jra55 physics forcing data.

iobinary tests must use the old gx3 binary forcing, the jra55_gx3 netcdf data
causes an abort because iobinary is tested with netcdf turned off. the old
gx3 data is NCAR_bulk in binary format.

alt01 with jra55 uses the alt01a setting. the difference is that
use_leap_years is set to false in alt01a. It's set to true by default
in the jra55 datasets.

To-do (copied from conversation below):

Change default emissivity to 0.985
Change default nblyr to 1

Notes from global/ (these are hemispheric values produced by print_global=T):

We should turn on the brine tracer for these tests. When it's off, maybe we shouldn't print this diagnostic.
Should there be rain water in the Arctic in Jan/Feb/Mar?
The arwt tot mass, energy, salt diagnostics are wrong.

Notes from point/ (these are values for particular lat/lon points produced by print_point=T):

Arctic SST is wrong: large spikes, 2 lines, y axis values are wrong magnitude. This might just be a diagnostic issue.
Why is the Antarctic shortwave radiation sum = 0 in mid summer?
Why is the snow change always 0?

The following issue may be more about the physical representation than diagnostic calculations. From the point plots:

Why does gx1 Antarctic heat used go to an approximately constant (barely decreasing), nonzero value in mid-Feb? The area fraction ~0; bottom, top & lateral melt stops but thickness continues to decrease, and sensible, latent heat, outward longwave flux are all nonzero. The gx3 behavior is the opposite - Antarctic heat used is larger after mid-Feb, and the various terms remain nonzero. It could just be that the grid cells we are looking at are experiencing different conditions and hit the ice/no-ice threshold differently, but they should be fairly close to each other (lat/lon) so I wouldn't expect the conditions to be that different.

eclare108213 · 2020-11-25T20:20:16Z

cice.setup: ERROR, /turquoise/usr/projects/climate/eclare/CICE.v6.1/apcraig-jra55_update/configuration/scripts/options/set_[nml,env].alt01a not found

apcraig · 2020-11-25T20:22:51Z

cice.setup: ERROR, /turquoise/usr/projects/climate/eclare/CICE.v6.1/apcraig-jra55_update/configuration/scripts/options/set_[nml,env].alt01a not found

sorry about that. I have added that file, it should be fixed now.

eclare108213 · 2020-11-25T20:24:01Z

That was quick! Thanks.

eclare108213 · 2020-11-25T20:32:09Z

I'm starting to compare the time series plots for the base_suite tests. I'll also look through the log files for inconsistent settings. Should I look at other test suites?

apcraig · 2020-11-25T20:34:52Z

base_suite is the main one. The other suites are largely either subsets of the base_suite or technical testing that use mainly the default namelist for science.

eclare108213 · 2020-11-25T22:52:42Z

I would like to suggest that we change the default value of emissivity from 0.95 to 0.985, when we switch to JRA forcing. It's more realistic. Any reason not to do this (other than making comparing the output more painful)?

apcraig · 2020-11-25T22:56:27Z

We can certainly change the value of emissivity. I will make that change a separate step and test it.

eclare108213 · 2020-11-25T23:26:39Z

I think we should also set nblyr=1 as the default (it appears to be 7 now)

apcraig · 2020-11-26T18:36:00Z

7bf885b makes jra55 the defaults by updating the gx1 and gx3 defaults, getting rid of the set_nml.jra55* options, and adding gx3ncarbulk and gx1coreii options. The test names were changed to reflect the new options, and all this was compared to the prior baseline using links to address the test renaming, and everything is confirmed to be bit-for-bit, https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#7bf885bb2acfbaa6dd6d1d915d730bd0f0368910. There are 4 new tests added in the new test lists that did not have baselines.

This commit leverages "zz" option names to make renaming easier. The next step will be to remove the zz and rerun the test suite to establish new baselines.

eclare108213 · 2020-11-29T01:34:47Z

Base_suite results on badger:

328 measured results of 328 total results
321 of 328 tests PASSED
1 of 328 tests PENDING
0 of 328 tests MISSING data
6 of 328 tests FAILED

The failing tests are the usual ones calling for 40 pes. The pending test also failed initially:

PEND badger_intel_restart_gbox128_2x2_boxadv_short run

I tried to rerun it and killed it after waiting a while - I don't understand what's wrong. It seems to be hanging and eventually timing out, although it initially did start to run and write data to the diagnostic output file.

eclare108213 · 2020-11-30T18:50:54Z

I have made time series plots for all of the global and point diagnostics in all of the base_suite runs, for both the current forcing and JRA55. They mostly look okay to me, although there are a few things that need to be looked at more closely and possible fixed, particularly for the alt0* runs. I'll come back to those.

Meanwhile, I'm hoping that the testing/analysis team can answer or address the items below. Attached here are the plots for the 90-day JRA55 runs for both gx3 and gx1.
90day.tar.gz

Notes from global/ (these are hemispheric values produced by print_global=T):

We should turn on the brine tracer for these tests. When it's off, maybe we shouldn't print this diagnostic.
Should there be rain water in the Arctic in Jan/Feb/Mar?
The arwt tot mass, energy, salt diagnostics are wrong.
Spikes in the error plots could help us locate conservation leaks.

Notes from point/ (these are values for particular lat/lon points produced by print_point=T):

Arctic SST is wrong: large spikes, 2 lines, y axis values are wrong magnitude. This might just be a diagnostic issue.
Why is the Antarctic shortwave radiation sum = 0 in mid summer?
Why is the snow change always 0?

The following issue may be more about the physical representation than diagnostic calculations. From the point plots:

Why does gx1 Antarctic heat used go to an approximately constant (barely decreasing), nonzero value in mid-Feb? The area fraction ~0; bottom, top & lateral melt stops but thickness continues to decrease, and sensible, latent heat, outward longwave flux are all nonzero. The gx3 behavior is the opposite - Antarctic heat used is larger after mid-Feb, and the various terms remain nonzero. It could just be that the grid cells we are looking at are experiencing different conditions and hit the ice/no-ice threshold differently, but they should be fairly close to each other (lat/lon) so I wouldn't expect the conditions to be that different.

apcraig · 2020-11-30T19:36:32Z

@eclare108213, if the 40 pe cases fail, would it be helpful to limit the number of pes used in testing? What should that number be? 32? 24? 16?

Regarding the diagnostics, let me know if there is anything in particular you want me to look at. I hope we can fix the diagnostics as well as the arwt computation asap?

eclare108213 · 2020-11-30T19:50:13Z

It's okay with me to keep using the 40-pe tests, just knowing they'll fail on badger. It seems like we added something to the scripts for this kind of machine limitation, but if so, obviously I haven't been using it.

Have you run into any issues with the intel_restart_gbox128_2x2_boxadv_short test? Since it's failed repeatedly for me on this PR, I don't think it's a fluke, but it could be a machine-specific problem.

If you (or anyone) would like to tackle the diagnostic issues, go right ahead... They're a bit lower on the totem pole for me, personally, but I agree we ought to fix them ASAP. I'll take a look at the alt0* tests next, then I plan to revert the upwind code (unless someone else wants to do that...) :)

apcraig · 2020-11-30T20:28:05Z

I'll take care of the unwind code, #508.

rallard77 · 2020-11-30T22:22:05Z

Can you share your log file from your 90-day JRA55 gx1 test? Rick

…

On 11/30/2020 12:51 PM, Elizabeth Hunke wrote: I have made time series plots for all of the global and point diagnostics in all of the base_suite runs, for both the current forcing and JRA55. They mostly look okay to me, although there are a few things that need to be looked at more closely and possible fixed, particularly for the alt0* runs. I'll come back to those. Meanwhile, I'm hoping that the testing/analysis team can answer or address the items below. Attached here are the plots for the 90-day JRA55 runs for both gx3 and gx1. 90day.tar.gz <https://github.com/CICE-Consortium/CICE/files/5617944/90day.tar.gz> Notes from global/ (these are hemispheric values produced by print_global=T): * We should turn on the brine tracer for these tests. When it's off, maybe we shouldn't print this diagnostic. * Should there be rain water in the Arctic in Jan/Feb/Mar? * The arwt tot mass, energy, salt diagnostics are wrong. * Spikes in the error plots could help us locate conservation leaks. Notes from point/ (these are values for particular lat/lon points produced by print_point=T): * Arctic SST is wrong: large spikes, 2 lines, y axis values are wrong magnitude. This might just be a diagnostic issue. * Why is the Antarctic shortwave radiation sum = 0 in mid summer? * Why is the snow change always 0? The following issue may be more about the physical representation than diagnostic calculations. From the point plots: * Why does gx1 Antarctic heat used go to an approximately constant (barely decreasing), nonzero value in mid-Feb? The area fraction ~0; bottom, top & lateral melt stops but thickness continues to decrease, and sensible, latent heat, outward longwave flux are all nonzero. The gx3 behavior is the opposite - Antarctic heat used is larger after mid-Feb, and the various terms remain nonzero. It could just be that the grid cells we are looking at are experiencing different conditions and hit the ice/no-ice threshold differently, but they should be fairly close to each other (lat/lon) so I wouldn't expect the conditions to be that different. — You are receiving this because your review was requested. Reply to this email directly, view it on GitHub <#533 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AG63LB35VPAGLPSLRTXDQ6LSSPSRZANCNFSM4UBKFCNQ>.

apcraig · 2020-11-30T22:37:18Z

I have updated the PR again only removing the "zz" naming option (08d6f70) to generate a new suite of baselines to continue the process moving forward. Those baselines are https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#08d6f70c6f4ee4994bf811552affc2992a74616e. All tests pass but many of the regression tests are missing because the names have changed.

From this point, we could merge the PR and then as a separate PR update some of the tests. Or we could continue to allow this PR to evolve and update tests here before merging.

apcraig · 2020-11-30T22:40:50Z

Actually, at least one more step is required which is to remove the "extra" old ncarbulk and coreii tests. But I think that could be the last step before we merge.

apcraig · 2020-11-30T22:42:50Z

Again, so far, all I've done is carefully changed the default forcing to JRA55, add optional gx1 coreii and gx3 ncarbulk options, and run full test suites on 3 compilers on cheyenne to make sure it's all working properly.

eclare108213 · 2020-11-30T23:18:18Z

cice_runlog_gx1.txt
cice_runlog_gx3.txt

eclare108213 · 2020-12-03T21:25:30Z

eclare108213 · 2020-12-03T21:30:09Z

eclare108213 · 2020-12-03T21:34:13Z

eclare108213 · 2020-12-07T16:13:43Z

I added issue #538 describing some of the plotting problems.
Edit with more information: I ran the 90-day tests with the original data but otherwise as in the JRA55 90-day tests, and they all have the same plotting problems, so that is not data-specific. It appears to be a problem with the script - possibly with my changes to the script to plot variables that aren't usually included.

eclare108213 · 2020-12-07T18:43:20Z

I checked the sst values in the alt02 and alt05 tests, which looked like they were exactly 0. They aren't - there's a spike at the end of the time series which is making the y axis very large. However, the values appear to be constant (-1.90458264992426463) which still didn't make sense, since the mixed layer calculation is turned on. It's right, though - the mixed layer routine is producing frazil ice of different amounts through the run; sst is pegged at a constant value because the salinity is constant, and the run isn't long enough to see warmer sst in this grid cell. So I think the alt02 and alt05 tests are behaving as expected, at least for the mixed layer calculation.

rallard77 · 2020-12-11T16:37:41Z

cice.runlog.201210-072851.gz
I ran the alt05 test for a full year. The SST results appear reasonable. The SST values change during the warmer months...

eclare108213 · 2020-12-11T16:58:46Z

Thanks @rallard77, that's good. I've added the list of to-do items above to the top of the PR, as a checklist.

jra55 tests for different grids gx3 uses jra55_gx3 gx1 uses jra55_gx1 tx1 is already using jra55 by default (no changes to tests required) gbox* uses internal forcing, no jra55 data (no changes to tests required) remove ocn_data_dir='default' and bgc_data_dir='default' in set_nml.jra55* files because it causes the bgc tests to fail. the bgc tests work with WOA bgc forcing data with jra55 physics forcing data. iobinary tests must use the old gx3 binary forcing, the jra55_gx3 netcdf data causes an abort because iobinary is tested with netcdf turned off. the old gx3 data is NCAR_bulk in binary format. alt01 with jra55 uses the alt01a setting. the difference is that use_leap_years is set to false in alt01a. It's set to true by default in the jra55 datasets.

set_nml.gx3 and set_nml.gx1 are now JRA55 by default get rid of set_nml.jra55* files add set_nml.gx3ncarbulk and set_nml.gx1coreii which invokes prior forcing datasets modify the test suites to do identical tests but with new options, compare to prior implementation by creating a renamed regression baseline via a script and links to generate the new names. Use zz* for the new set_nml options temporarily just to make it easy to generate the new links (the zz ensures the new options are last in the list by alphabet) chmod -x all set_* files for consistency

this is done as a clean separate step so new baselines can be established to continue to evolve the test cases and allow regression testing

apcraig · 2020-12-22T17:31:40Z

I have merged in the latest master to this branch and retested, https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#97b11eac6673ee2fc03f893fbb03e28847bb5f37. Comparison is to the prior set of results for this branch. There are lots of non bit-for-bit results because Icepack changed answers in the 1.2.4 release. But I think the results are all as expected. The transition from the old version to the new version was done carefully.

I would like to propose we consider merging this soon and then creating a separate issue to continue to "fix" coverage and settings that have been identified. I think it would be helpful to start testing the new test suites and allow continued development to work from the JRA55 default. Even #548 is using the old forcing data to test the zsal feature which is probably not what we want in the longer term.

The one things we will do (either in this PR or in another PR soon) is remove a bunch of the old forcing tests. At the current time, we are basically doing double the testing, all with old forcing and all with new forcing. Should I remove most of those tests now?

apcraig · 2020-12-22T21:07:34Z

I just committed a change to remove most of the old forcing test cases. So we're back to more typical size test suite (150 from 274), https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#97b11eac6673ee2fc03f893fbb03e28847bb5f37

apcraig · 2020-12-23T20:12:18Z

@eclare108213, are you open to merging this PR now/soon? The focus here is on refactoring the test suites and making JRA55 the default. Once this is merged, I feel comfortable that we can carry out concurrent development on multiple tasks including tasks associated with further validating the JRA55 output. I worry that if this PR is delayed, that concurrent development will happen anyway and we'll end up with a bit of a divergence in this branch and other tasks. I think the only reasons to delay a merge is if we question whether JRA55 should be the default forcing in the short term.

eclare108213

I don't understand why set_env.box2001, set_nml.box2001 and set_nml.gbox80 say "Empty file" in the Files Changed tab. Is that just a github bookkeeping weirdness?

What's the reason for having alt01a now? I know that it uses leap days now and didn't before - is that the only difference?

I'm generally okay with merging this PR soon and continuing the work elsewhere.

apcraig · 2020-12-23T20:36:56Z

The empty files just change permissions on the file to make them consistent with the rest of the files. There are no internal changes.

The alt01a is a good catch. That was needed when both old and new forcing was being tested. Let me rename that file to alt01, update the test suites, rerun the suite, and then commit the change. I'll let you know when I've fixed that.

apcraig · 2020-12-23T22:16:47Z

I've renamed alt01a to alt01 and run a full test suite, https://github.com/CICE-Consortium/Test-Results/wiki/cice_by_hash_forks#0eeefcfd19ee448cef3315caf9e2a615b6098b42

apcraig · 2020-12-23T23:05:48Z

I moved the checklist to #249

apcraig added Testing Priority: High Forcing labels Nov 24, 2020

apcraig requested review from eclare108213, dabail10 and rallard77 November 24, 2020 19:32

apcraig self-assigned this Nov 24, 2020

apcraig mentioned this pull request Nov 24, 2020

added modified test scripts for JRA55 forcing with the gx3 grid #530

Closed

12 tasks

eclare108213 mentioned this pull request Nov 26, 2020

Improve testing code coverage #437

Open

eclare108213 mentioned this pull request Dec 7, 2020

timeseries plotting errors #538

Closed

eclare108213 mentioned this pull request Dec 11, 2020

Move sw_redist, sw_frac, sw_dtemp from thermo to shortwave namelist #540

Merged

16 tasks

apcraig added 5 commits December 21, 2020 14:53

add wget of JRA55_gx3 forcing to travis yml

d748583

add set_nml.alt01a

9904986

rename zz test options

97b11ea

this is done as a clean separate step so new baselines can be established to continue to evolve the test cases and allow regression testing

apcraig marked this pull request as ready for review December 22, 2020 17:31

apcraig force-pushed the jra55_update branch from 08d6f70 to 97b11ea Compare December 22, 2020 17:33

remove most old forcing test cases

0eeefcf

eclare108213 reviewed Dec 23, 2020

View reviewed changes

rename alt01a to alt01 in test options and suite

9f3cd99

eclare108213 approved these changes Dec 23, 2020

View reviewed changes

apcraig mentioned this pull request Dec 23, 2020

Replace forcing for gx1 #249

Closed

apcraig merged commit 2b10c11 into CICE-Consortium:master Dec 23, 2020

eclare108213 mentioned this pull request May 20, 2021

JRA forcing initialization/spinup #481

Closed

apcraig deleted the jra55_update branch August 17, 2022 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make JRA55 default forcing dataset #533

Make JRA55 default forcing dataset #533

apcraig commented Nov 24, 2020 •

edited by eclare108213

Loading

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 26, 2020

eclare108213 commented Nov 29, 2020

eclare108213 commented Nov 30, 2020

apcraig commented Nov 30, 2020

eclare108213 commented Nov 30, 2020

apcraig commented Nov 30, 2020

rallard77 commented Nov 30, 2020 via email

apcraig commented Nov 30, 2020 •

edited

Loading

apcraig commented Nov 30, 2020

apcraig commented Nov 30, 2020

eclare108213 commented Nov 30, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 7, 2020 •

edited

Loading

eclare108213 commented Dec 7, 2020

rallard77 commented Dec 11, 2020

eclare108213 commented Dec 11, 2020

apcraig commented Dec 22, 2020

apcraig commented Dec 22, 2020

apcraig commented Dec 23, 2020

eclare108213 left a comment

apcraig commented Dec 23, 2020

apcraig commented Dec 23, 2020

apcraig commented Dec 23, 2020

Make JRA55 default forcing dataset #533

Make JRA55 default forcing dataset #533

Conversation

apcraig commented Nov 24, 2020 • edited by eclare108213 Loading

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 25, 2020

eclare108213 commented Nov 25, 2020

apcraig commented Nov 26, 2020

eclare108213 commented Nov 29, 2020

eclare108213 commented Nov 30, 2020

apcraig commented Nov 30, 2020

eclare108213 commented Nov 30, 2020

apcraig commented Nov 30, 2020

rallard77 commented Nov 30, 2020 via email

apcraig commented Nov 30, 2020 • edited Loading

apcraig commented Nov 30, 2020

apcraig commented Nov 30, 2020

eclare108213 commented Nov 30, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 3, 2020

eclare108213 commented Dec 7, 2020 • edited Loading

eclare108213 commented Dec 7, 2020

rallard77 commented Dec 11, 2020

eclare108213 commented Dec 11, 2020

apcraig commented Dec 22, 2020

apcraig commented Dec 22, 2020

apcraig commented Dec 23, 2020

eclare108213 left a comment

Choose a reason for hiding this comment

apcraig commented Dec 23, 2020

apcraig commented Dec 23, 2020

apcraig commented Dec 23, 2020

apcraig commented Nov 24, 2020 •

edited by eclare108213

Loading

apcraig commented Nov 30, 2020 •

edited

Loading

eclare108213 commented Dec 7, 2020 •

edited

Loading